[WIP] Implements RNNT+MMI #1030

pkufool · 2022-08-09T11:56:13Z

It runs normally in my self-constructed test case, not fully tested yet, though.

The sampled paths:

sampled_paths = torch.tensor([ [ [ 3, 5, 0, 4, 6, 0, 2, 1 ],
                                 [ 2, 0, 5, 4, 0, 6, 1, 2 ],
                                 [ 3, 5, 2, 0, 0, 1, 6, 4 ]],
                               [ [ 7, 0, 4, 0, 6, 0, 3, 0 ],
                                 [ 0, 7, 3, 0, 2, 0, 4, 5 ],
                                 [ 7, 0, 3, 4, 0, 1, 2, 0 ]]], dtype=torch.int32)

The corresponding lattice:

Note: There is an arc from state 2 to state 17 in the second lattice, because the last symbol of the second path of second sequence is sampled at frame 1, it is a simulation of reaching final frame.

pkufool

@danpovey Do you have any good idea to test this function, I can only think of constructing simple test cases.

pkufool · 2022-08-09T12:04:59Z

k2/csrc/fsa_algo.cu

+                  repeat_num = us_row_splits1_data[us_idx0 + 1] -
+                    us_row_splits1_data[us_idx0];
+
+          arc.score = -logf(1 - powf(1 - sampling_prob, repeat_num));


I only include the "predictor" head output in C++ part, the other two scores (i.e. hybrid output and lm_output) will add on python part, it would be easier to enable autograd for hybrid output.

pkufool · 2022-08-09T12:08:18Z

k2/python/k2/fsa_algo.py

+    a_value = getattr(lattice, "scores")
+    # Enable autograd for path_scores
+    b_value = index_select(path_scores.flatten(), arc_map)
+    value = a_value + b_value


path_scores here will contain hybrid_output and detached lm_output. I include the path_scores here and enable antograd to path_scores.

Yes, OK. Right, we treat those as differentiable, but the negated sampling_prob is treated as just a constant.

pkufool · 2022-08-09T12:13:16Z

k2/python/tests/generate_denominator_lattice_test.py

+        # index == 0 means the sampled symbol is blank
+        t_mask = index == 0
+        #         t_index = torch.where(t_mask, t_index + 1, t_index)
+        t_index = t_index + 1


If we use regular RNN-T, it is possible to generate too many symbols for a specific frame, and that might be chances to generate a lattice containing cycles, which is not expected. I am not sure whether we will encounter such a issue at the very beginning of training.

Hm, a valid point. Yes, computing forward backward scores would not work correctly if there are cycles. One possibility would be to augment the state with a sub-frame, i.e. instead of (ctx, t) it becomes (ctx, t, sub_t) with sub_t = (0, 1, 2, ..). That would prevent cycles, although it might prevent a small number of paths from recombining that might otherwise be able to recombine.

…malized rnnt loss

pkufool added 3 commits August 4, 2022 09:48

draft of importance sampling algorithm

5a79f32

Generate denominator lattice from sampled paths

05aa9f3

Fix typos

3511869

pkufool commented Aug 9, 2022

View reviewed changes

pkufool marked this pull request as draft August 17, 2022 02:43

pkufool added 6 commits August 17, 2022 10:45

Only allow states sampled on final frame to be final state; add unnor…

5dc671f

…malized rnnt loss

Minor fixes

9d8e3e3

fix importance sampling scores; return arc_map

2a14970

return grad in rnnt_loss_pruned

a9fea9d

Merge branch 'master' into rnnt_mmi

2efa930

Remove redundant code

2be0926

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] Implements RNNT+MMI #1030

[WIP] Implements RNNT+MMI #1030

pkufool commented Aug 9, 2022 •

edited

Loading

pkufool left a comment

pkufool Aug 9, 2022

pkufool Aug 9, 2022

danpovey Aug 17, 2022

pkufool Aug 9, 2022

danpovey Aug 17, 2022

[WIP] Implements RNNT+MMI #1030

Are you sure you want to change the base?

[WIP] Implements RNNT+MMI #1030

Conversation

pkufool commented Aug 9, 2022 • edited Loading

pkufool left a comment

Choose a reason for hiding this comment

pkufool Aug 9, 2022

Choose a reason for hiding this comment

pkufool Aug 9, 2022

Choose a reason for hiding this comment

danpovey Aug 17, 2022

Choose a reason for hiding this comment

pkufool Aug 9, 2022

Choose a reason for hiding this comment

danpovey Aug 17, 2022

Choose a reason for hiding this comment

pkufool commented Aug 9, 2022 •

edited

Loading